Mon. Not. R. Astron. Soc. OOP. [TlfTSl C2012') Printed 3 March 2013 (MN 1^1^ style file v2.2) 



Large Scale Structure in Absorption: Gas within and 
around Galaxy Voids 

^ Nicolas Tejos^"*", Simon L. Morris^, Neil H. M. Crighton^, Tom Theuns^ Gabriel 
O Altay\ Charles W. Finn^ 

' ^Department of Physics, Durham University, South Road, Durham, DHl 3LE, UK. 
Max Planck Institute for Astronomy, Kdnigstuhl 17, D- 691 17 Heidelberg, Germany. 



Draft version 



ABSTRACT 

We investigate the properties of the H i Ly-a absorption systems (Ly-a forest) within 
and around galaxy voids at z < 0.1. We find a significant excess (> 99 per cent confi- 
dence level, c.l.) of Ly-a systems at the edges of galaxy voids with respect to a random 
distribution, on ~ 5 h~^Mpc scales. We find no significant difference in the number 
of systems inside voids with respect to the random expectation. We report differences 
between both column density (iVui) and Doppler parameter (6hi) distributions of Ly- 
a systems found inside and at the edge of galaxy voids at the > 98 and > 90 per 
cent c.l. respectively. Low density environments (voids) have smaller values for both 
iVni and 5hi than higher density ones (edges of voids). These trends are theoreti- 
cally expected and also found in GIMIC, a state-of-the-art hydrodynamical simulation. 
Our findings are consistent with a scenario of at least three types of Ly-a systems: 
(1) containing embedded galaxies and so directly correlated with galaxies (referred 
as 'halo-like'), (2) correlated with galaxies only because they lie in the same over- 
dense LSS, and (3) associated with under-dense LSS with a very low auto-correlation 
amplitude (« random) that are not correlated with luminous galaxies. We argue the 
latter arise in structures still growing linearly from the primordial density fluctuations 
inside galaxy voids that have not formed galaxies because of their low densities. We 
estimate that these under-dense LSS absorbers account for 25 — 30 ± 6 per cent of the 
current Ly-a population (A^hi ^ lO^'^'^ cm^^) while the other two types account for 
the remaining 70 — 75 ± 12 per cent. Assuming that only iVni ^ 10^^ cm~^ systems 
have embedded galaxies nearby, we have estimated the contribution of the 'halo-like' 
Ly-a population to be « 12 — 15 ± 4 per cent and consequently « 55 — 60 ± 13 per 
cent of the Ly-a systems to be associated with the over-dense LSS. 

Key words: intergalactic medium: Ly-a forest -large-scale structure of the universe: 
galaxy voids, galaxy filaments -quasars: absorption lines 



1 INTRODUCTION 



The inter-galactic medium (IGM ) hosts the main reser- 
voirs of baryons at all epochs (see IProchaska &; TumlinsonI 
l2009l for a review). This is supported by both observa- 
tions (e.g. iFukueita et al.|[l99i : iFukugita fc Peebles! l2004l: 



Shull et al. _ 201ll'l and s i mulations (e.g., ICen fc Ostrikeij 



19991 : iTheuns et al.|[l999l : iDave et"alll2010l ). Efficient feed 



back mechanisms that expel material from galaxies to the 
IGM are required to e xplain the statistical properties of the 
observed galaxies ( e.g., lBaugh et al.ll2005l :[ Bower etiil ]|2006l : 
ISchave et al.|[201ol ). Given that galaxies are formed by ac- 
creting gas from the IGM, a continuous interplay between 
the IGM and galaxies is then in place. Consequently, un- 
derstanding the relationship between the IGM and galax- 
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ies is key to understanding galaxy formation and evolution. 
This has been recognized since the earliest Hubble Space 
Telescope (HST) spectroscopy of QSOs, where the asso- 
ciation between low-z IGM absorption systems and galax- 
ies w a s investigated for t h e first time (e.g.. [Spin rad ct al] 
19931: iMorris et al.l [1993: iMorris fc van deuBerghl 1199^ : 



Stocke et al.lll995l : lLanzetta et al.lll995l ') 



The large scale environment in wh ich matter resides is 
also important, as it is predicted (e.g.. iBorgani et aL I I2OO2I : 



Padilla et al.l |2009|') and observed (e .g.. Lewis et"all 



2OO2I : 



Lopez et al.ll200^ : iPadilla et al.ll2010l ) to have non negligible 



effects on the gas and galaxy properties. Given that baryonic 
matter is expected to fall into the considerably deeper gravi- 
tational potentials of dark matter, the IGM gas and galaxies 
are expected to be predominantly found at such lo cations 
forming the so called 'cosmic web' (|Bond et al.lll99^ V Iden- 
tification of large scale structures (LSS) like galaxy clusters. 
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filaments or voids and tlieir influence over the IGM and 
galaxies is then fundamental to a complete picture of the 
IGM/galaxy connection and its evolution over cosmic time. 

With the adve nt of big galaxy surveys such as the 2dF 
JColless et al.ll200ll') o r the Sloan Digital Sky Survey (SDSS, 
lAbazaiian et al .I I2OO9I ) it has been possible to directly ob- 
serve the nature and extent of the distribution of stellar 
matter in the local universe. Galaxies tend to lie in the fila- 
mentary structure which simulations predict, however, very 
little is known about the actual gas distribution at \aw-z. In 
this work we focus on the study of H I Ly-a (hereafter re- 
ferred simply as Ly-a) absorption systems found within and 
around galaxy voids at 2: < 0.1. 

Galaxy voids are the best candidates to start our 
statistical study of LSS in absorption. Voids account 
for up to 60 — 8 0% of the volume of the universe 
at z = (e.g., lAragon-Calvo et all I2OI0I : |Pan et al.1 
I2OI2I ). Some studies have suggested that when a min- 
imum density threshold is reach ed, voids grow in a 
spherically symmetric way (e.g., Regos fc Gelle"il Il99ll : 



Ivan de Weveaert fc van Kampenlll993l l. This suggests that 
voids have a relatively simple geometry, which makes them 
comparatively easy to defi ne and identify fro m current 
galaxy surveys (although see IColberg et aLlboOSl . for a dis- 
cussion on different void finder algorithms). Galaxy voids 
are a unique environment in which to look for evidence 
of early (or even p rimordial) enrichment of the IGM (e.g., 
IStocke et al.l [20071 ). It is interesting that galaxy voids are 
present even in the distribution of low mass galaxies (e.g., 
|Peeblej[200ll : iTikhonov fc KlvpinI 12009') and so there must 
be mechanisms that prevent galaxies from forming in such 
low density environments. 

Previous studies of Ly-a absorption systems associated 
with voids at low-z have r elied on a 'nea r est ga l axy distance' 



(NGD) definition, (e.g iPenton et"ai] I2OO2I : IStocke et all 



I2OO7I : IWakker fc SavagelbOOgD . In order to have a clean def- 
inition of void absorbers the N GD must be l arge, leading 
to small samples. For instance, iPenton et al.l (I2OO2I ') found 
only 8 void absorbers (from a total of 46 systems) de- 
fined as bein g located at > 3 h^^ Mp c from the nearest 
> L* galaxy. IWakker fc Savage! (|2009l ) found 17 void ab- 
sorbers (from a tot al of 102) based on the same definition. 
IStocke et al.l |2007') had to relax the previous limit to > 1.4 
h^Q Mpc in order to find 61 void absorbers (from a total of 
651 systems), although only 12 were used in their study on 
void metallicities. Note that a low NGD limit (of 1.4 h^^ 
Mpc) could introduce some contamination of not-void ab- 
sorbers. This is because filaments in the 'cosmic web' are ex- 
pected to be a couple of Mpc in radius ([Gonzalez fc PadiHal 
I2OIOI : lAragon-Calvo et al.|[201ol : [Bond et al.ll2010t ). Consid^ 
ering the Local Group as an example, being 1.4 h^^ Mpc 
away from either the Milky Way or Andromeda cannot 
be considered as being in a galaxy void. On the other 
hand, give n that there is a population of galaxies inside 
voids fe.g.. lRoias et al.ll2005l : |Park et al.ll2007i : lKreckel et al.1 
120111 '). the NGD definition could also miss some 'real' void 
absorbers relatively close to bright isolated galaxies. In fact, 
IWakker fc Savage! (|2009l ) found that there may be no void 
absorbers in their sample (based on the NGD definition) if 
the luminosity limit to the closest galaxy is reduced to O.IL*. 
Note however that their sample is very local (z ^ 0.017 or 



< 70 h^Q Mpc away), and it might be biased because of the 
local overdensity to which our Local Group belongs. 

In this work we use a different approach to define 
void absorption systems. We based our definition on cur- 
rent galaxy void catalogs (typical radius of > 14 h^^ Mpc), 
defining void absorbers as those located inside such galaxy 
voids. This leads to larger samples of well identified void 
absorbers compared to previous studies. Moreover, this ap- 
proach allows us to define a sample of absorbers located at 
the very edges of voids, that can be associated with walls, 
filaments and nodes, allowing us to get some insights in the 
distribution of gas in the 'cosmic web' itself. This definition 
is different from the NGD based ones and it focuses on the 
'large scale' (> 10 Mpc) relationship between Ly-Q forest 
systems and galaxies. The results from this work will offer a 
good complement to previous studies based on 'local' scales 
(< 2 Mpc). 

Our paper is structured as follows. The catalogs of both 
Ly-a systems and galaxy voids that we used in this work are 
described in !j2l Definition of our LSS in absorption samples 
and the observational results are presented in We com- 
pare our observational results with a recent cosmological 
hydrodynamical simulation in fjl] We discuss our findings 
in SJSI and summarize them in A check for systematic ef- 
fects and biases that could be present in our data analysis 
is presented in Appendix [X] All distances are in co-moving 
coordinates assuming Ho = 100 h km s'^Mpc'S h = 0.71, 
fim = 0.27, Q,h = 0.73, = cosmology unless otherwise 
st ated. This c o smolo gy was chosen to match the one adopted 
bv !Pan et al.! l|2012! 'l (D. Pann, private communication; see 



2 DATA 

2.1 Gas in absorption 

We use QSO absorption line data from the lDanforth fc Shulll 
(2008, hereafter DS08) catalog, which is the largest high- 
resolution (R = ^ ^ 30 000 - 100 000), low-z IGM sam- 
ple to datqj. Briefiy, the catalog lists 651 Ly-a absorption 
systems at Zabs ^ 0.4, with associated metal lines (O vi, 
N V, C IV, C III, Si IV, Si lll and Fe lll; when the spec- 
tral coverage and signal-to-noise allowed their observation), 
taken from 28 AGN observed with both the Space Tele - 
scope Imaging Spectrograph (STIS, IWoodgate^ et al.! Il998l ) 
on the HST, and the Far Ultraviolet Spectroscopic Explorer 
(^FUSE. [Moos et al]l2000l ). The systems are characterized by 
their rest-frame equivalent widths (Wr), or upper limits on 
Wr, for each individual transition. Column densities (A^'hi) 
and Doppler parameters (6hi) were inf erred usi ng the a|> 
paren t optical depth method (AODM, ISavage fc Sc inb ach! 
I1991! ) and/or Voigt profile line fitting. In particular for the 
Ly-a transition, a curve-of-growth (COG) solution was used 
when other Lyman series lines were available (see also Ap- 
pendix |X2]). We refer the reader to DS08 (and references 
therein) for further description and discussion. 

In order to identify absorbing gas associated with 



^ We note that after this paper was submitted, a new pre-print 
by iTilton et al.l t2012.) appeared with an updated version of the 
DS08 catalog. 
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Table 1. IGM sightlines from DS08 that intersect the SDSS 
survey. 



Sight Line 



RA (J2000) Dec (J2000) ^agn 



PG 0953+414 


09 


56 


52.4 


+41 


15 


22 


0.23410 


14 


Ton 28 


10 


04 


02.5 


+28 


55 


35 


0.32970 


9 


PG 1116+215 


11 


19 


08.6 


+21 


19 


18 


0.17650 


18 


PG 1211+143 


12 


14 


17.7 


+14 


03 


13 


0.08090 


30 


PG 1216+069 


12 


19 


20.9 


+06 


38 


38 


0.33130 


3 


30 273 


12 


29 


06.7 


+02 


03 


09 


0.15834 


35 


Q 1230+0115 


12 


30 


50.0 


+01 


15 


23 


0.11700 


12 


PG 1259+593 


13 


01 


12.9 


+59 


02 


07 


0.47780 


12 


NGC 5548 


14 


17 


59.5 


+25 


08 


12 


0.01718 


13 


Mrk 1383 


14 


29 


06.6 


+01 


17 


06 


0.08647 


16 


PG 1444+407 


14 


46 


45.9 


+40 


35 


06 


0.26730 


10 



^Median HST/STIS signal-to- noise ratio per two-pixel resolution 
element in the 1215 — 1340 A range (C. Danforth, private 
communication). The expected minimum equivalent width, 
Wmini a-t a c.l. of cl corresponding to a given S/N can be 



estimated from W, 



R(s}n) ' '^li*^!'^ ^ spectral 
resolution (e.g., see DS08). 



LSS (drawn from the SDSS DR7), we use a subsample 
of the DS08 AGN sightlines that intersect the SDSS vol- 
ume (PG 0953+414, Ton 28, PG 1116+215, PG 1211+143, 
PG 1216+069, 3C273, Q 1230+0115, PG 1259+593, 
NGC 5548, Mrk 1383 and PG 1444+407; see Table [l]). 
Despite the fact that PG 1216+069 spectrum has a poor 
quality, it is still possible to find strong systems in it, and 
so we decided to do not exclude it from the sample (this 
inclusion does not affect our results; see i l3.1|FI . We use 
the rest of the sightlines in the DS08 catalog to derive the 
general properties of the average absorber for comparison 
(see Appendix lAlfl . 

In our analysis we focus on statistical comparisons of 
the H I properties in different LSS environments. Metal sys- 
tems have smaller redshift coverage and lower number den- 
sities than Ly-a absorbers. Consequently, we do not aim to 
draw statistical conclusions from them. We intend to pursue 
metallicity studies in future work. 



are identified by looking for maximal empty spheres em- 
bedded in the wall galaxy sample. These individual void 
spheres have radii between 10 < iivoid ^ 25 h~^Mpc, with 
mean radius (J?void) ~ 13 fe~^Mpc. The minimum radius of 
10 /i~^Mpc for the void spheres was imposed. Only galax- 
ies with spectroscopic redshifts were used and therefore we 
expect the uncertainties in the void centers and radii to be 
small (< 1 Mpc; we will discuss the effects of peculiar ve- 
locities in H3.ip . Independent void regions are defined by 
combining all the adjacent spheres that share more than 
10% of their volume with another. Void galaxies are defined 
as those galaxies that lie within a void region. We refer the 
reader to P12 for further description and discussion. 

In our analysis, for simplicity, we use the individual 
spheres as separate voids instead of using the different in- 
dependent void regions. This choice has the following ad- 
vantages. First, it allows us to use a perfectly spherical ge- 
ometry, making it possible to characterize each void by just 
one number: its radius. Thus, we can straightforwardly scale 
voids with different sizes for comparison. Secondly, this ap- 
proach allows us to identify regions at the very edges of the 
voids. P12 found that the number density of galaxies has a 
sharp peak at a distance ~ Rvoid from the center of the void 
spheres, a clear signature that walls are well defined (at least 
from the point of view of bright galaxies at low redshifts). 
This is also consiste nt with the predictions of li near grav- 
itation theory (e.g., Ilckd 1 19841 : ISheth fc van de Wevga er3 
2004[) and dark matter simulations (e.g., Benson et al. 120031 : 
Colberg et aLll2005l : ICeccareUi et al.ll2006l . P12). Therefore, 
by looking for absorption systems very close to the edge of 
voids, we expect to trace a different cosmic environment. 
Thirdly, using the individual void spheres securely identi- 
fies void regions. The void-edge sample on the other hand, 
could be contaminated by void regions associated with the 
intersections of two void spheres. We checked that this is 
not the case though (see H3.2|l . This contamination should 
only reduce any possible difference between the two samples 
rather than enhance them. We also note that systematic un- 
certainties produced by assuming voids to be perfect spheres 
should also act to reduce any detected difference. 



3 DATA ANALYSIS AND RESULTS 



2.2 Galaxy voids 

We use a rece ntly released ga laxy-void catalog from SDSS 
DR7 galaxies (|Pan et al.ll2012l . hereafter P12), which is the 
largest galaxy- void sample to date. Hereafter we will use the 
term void to mean galaxy- void unless otherwise stated. P12 
identified > 1000 cos mic voids using the V oidFinder algo- 
rithm described by Hovle fc Vogele-yl (|200 2). with redshifts 
between 0.01 < 2 < 0.102. To summarize, it first uses a near- 
est neighbor algorithm on a volume limited galaxy survey. 
Galaxies whose third nearest neighbor distance is greater 
than 6.3 /i~^Mpc are classified as potential void galaxies, 
whereas the rest are classified as wall galaxies. Void regions 



3.1 Number density of absorption systems around 
voids 

We have cross-matched the IGM absorption line catalog 
from DS08 (see tj2.1|l with the void catalog from P12 (see 
i]2.2|l . A total of 106 Ly-a absorption systems were found in 
the 11 sightlines that intersect the void sample volume (i.e., 
those with 0.01 Zabs ^ 0.102). 

We first look for a possible difference in the number 
density of Ly-a systems as a function of the distance to 
voids. We take two approaches. First, we define X as the 
three-dimensional distance between an absorption system 
and the closest void center in Rvoid units, so 



We note that IChen &: MulchaevI ll2009h have presented a Ly- 
a absorption system list along PG 1216+069 sightline at a better 
sensitivity than that of DS08. In order to have an homogeneous 
sample, we did not include this new data in our analysis however. 



X ; 



; mm — 

sample /tvoid 



(1) 



where s is the co-moving distance between the center of the 
closest void and the absorber. Thus, ^ X < 1 corresponds 
to absorption systems inside voids and X > 1 corresponds 



4 Nicolas Tejos et al. 




Figure 1. Normalized (in area) distribution of H I absorption systems as a function of X (left panel; 0.1 binning) and D (right 
panel; 1 h~^Mpc binning) for both real and random samples. Error bars correspond to the Poissonian uncertainty from the analytical 

approximation ~ y^n~T~| + l and ~ ^Jn — ^ l lGehrelslllQSfit) . Real and random distributions are different at a > 99.5% confidence 
level (see i]3.1l for further details). 



to absorption systems outside voids. A value of X ~ 1 corre- 
sponds to absorption systems around void edges as defined 
by the galaxy distribution. 

The second approach defines D as the three-dimensional 
distance between an absorption system and the closest void 
edge in co-moving /i~^Mpc, so 

D= min (s - i?void) [h'^Mpc]. (2) 

sample 

Negative D values correspond to absorption systems inside 
voids while positive values correspond to absorbers outside 
voids. Values of D « h~^Mpc are associated with absorp- 
tion systems around void edges as defined by the galaxy 
distribution. 

Distances were calculated assuming the absorption sys- 
tems to have no peculiar velocities with respect to the cen- 
ter of the voids. Although this assumption might be realistic 
for gas inside voids, it might not be the case for gas resid- 
ing in denser environments, where gas outflows from galaxies 
might dominate. However, some studies have suggested that 
the bulk of Ly-a forest l ines have little velo c ity offset with 
respe ct to galaxies (e.g., iTheuns et al.ll20o3 : IWilman et al.l 
|2007| ). As an example, a velocity difference of ~ 200 km 

at redshift Zabs ^0.1 would give an apparent distance 
shift of the order of ~ 2 h~^Mpc, which is somewhat higher 
than, but comparable to the systematic error of the void cen- 
ter determination (given that voids regions are not perfectly 
spherical as assumed here) . Note that the uncertainty in the 
void center is smaller than the uncertainty of a single galaxy 
because the void is defined by an average over many galax- 
ies. As previously mentioned, such an uncertainty should not 
artificially create a false signal but rather should dilute any 
real difference. 

Although the X and D coordinates are not indepen- 
dent, we decided to show our results using both. This has 



the advantage of testing the consistency of our results using 
two slightly differently motivated definitions. X is a scaled 
coordinate, good for stacking voids of different radius. It 
is also good for comparisons with some of the P12 results. 
D gives a direct measure of the actual distances involved, 
while still using JJvoid- For convenience, results associated 
with the X definition will be shown normally in the text 
while results associated with the D definition will be shown 
in parenthesis: X (D) format. 

Figure [1] shows the histogram of absorption systems as 
a function of X and D (left and right panel respectively). 
In order to show the effects of the geometry of the survey, 
the random expectations are also shown (shaded distribu- 
tions). To generate the random samples, we placed 1000 
random absorption systems per real one, uniformly between 
ziim < 2abs ^ 0.102 for cach sightline, where zum corresponds 
to the maximum of 0.01 and the minimum observed red- 
shift for a Ly-a in that sightline. In this calculation we have 
masked out spectral regions over a velocity window of ±200 
km s~^ around the position where strong Galactic absorp- 
tion could have been detected (namely: C I, C ll, N v, O I, 
Si II, P III, S I, S II, Fe II and Ni ll) before the random redshifts 
are assigned. A total of 106 000 random absorbers were gen- 
erated. We observe a relative excess of absorption systems 
compared to the random expectation between X ~ 0.9-1.3 
and/or D ~ —2-4 h~^Mpc. Assuming Poisson uncertainty, 
there were 61 ± 8 (65 ± sfl observed, while 38.3 ± 0.2 
(42.4 ± 0.2) were expected from the random distribution. 
This corresponds to an « 30- excess. Similarly, there is a 
significant (~ 3a) deficit of absorption systems ai X > 1.3 

^ Results regarding distances from the center of voids are pre- 
sented in a X (D) format (see i]3.1l l. Reference to this footnote 
will be omitted hereafter. 
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Figure 2. Panels (a) and (b) show the distribution of cokimn densities of H I as a function of X and D respectively. Panels (c) and (d) 
show the distribution of Doppler parameters as a function of X and D respectively. Our LSS samples are shown by different color/symbols: 
void (black circles), void-edge (red squares) and unclassified (blue triangles). Histograms are also shown around the main panels. Vertical 
red dashed lines show the limits of our LSS definitions (see fjS^J. 



and/or -D > 4 h~^Mpc, for which 17±4 (19±4) systems were 
observed compared to the 33.9 ± 0.2 (34.1 ± 0.2) randomly 
expected. We also checked that such an excess and deficit 
did not appear by chance in 1000 realizations, consistent 
with the < 0.1% probabihty of occurrence. No significant 
difference is found for systems at X < 0.9 and/or D < —2 
/i~^Mpc, for which the 28 ± 5 (22 ± 5) found are consistent 
with the random expectation of 33.4 ± 0.2 (29.0 ± 0.2). The 
Kolmogorov-Smirnov (KS) test between the full unbinned 
samples gives a ~ 0.3% (0.5%) probability that both the 
random and the real data come from the same parent dis- 
tribution. We checked that no single sightline dominates the 
signal by removing each individual one and repeating the 
previous calculation. We also checked that masking out the 
spectral regions associated to possible Galactic absorption 
does not have an impact on our results as the same num- 
bers (within the errors) are recovered when these regions 
are not excluded. These results hint at a well defined gas 
structure around voids, possibly analogous to that seen in 



galaxies. The current data are not sufficient to confirm (at a 
high confidence level) the reality of the apparent two-peaked 
shape seen in the real distributions however. 



3.2 Definition of large scale structure in 
absorption 

We define three LSS samples observed in absorption: 

• Void absorbers: those absorption systems with X < 1 
and/or D < /i~^Mpc. A total of 45 (45) void absorbers 
were found. 

• Void-edge absorbers: those absorption systems with 1 ^ 
X < 1.3 and/or < D < 4 h'^Mpc. A total of 44 (42) void- 
edge absorbers were found. 

• Unclassified absorbers: those absorption systems with 
X ^ 1.3 and/or D ^ 4 h'^Mpc. A total of 17 (19) unclassi- 
fied absorbers were found. 
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Table 2. General properties of our LSS samples? 



Sample 


\og{NHi[cra-'^]) 
mean 


median 


hHi [km s 1 
mean 


median 


Void 
Edge 

Unclassified 


13.21 ± 0.67 (13.21 ± 0.67) 
13.50 ± 0.70 (13.52 ± 0.69) 
13.20 ± 0.45 (13.17 ± 0.48) 


13.05 (13.05) 
13.38 (13.38) 
13.36 (13.36) 


28 ± 15 (28 ± 15) 
33 ± 17 (34 ± 17) 
33 ± 11 (31 ± 11) 


25 (25) 
28 (28) 
32 (31) 



"Results are presented \n a. X (D) format (see i|3.1ll . 



To be consistent with the galaxy- void definition, we use 
X = 1 and/or D = /i~^Mpc as the limits between void 
and void-edge absorbers. The division between void-edge 
and unclassified absorbers was chosen to match the tran- 
sition between the overdensity to underdensity of observed 
absorbers compared to the random expectation at X > 1 
and/or D > h~^Mpc (see Figure[T]). 

We have assumed here that the center of galaxy voids 
will roughly correspond to the center of gas voids, however 
that does not necessarily imply that gas voids and galaxy 
voids have the same geometry. In fact, as we do not find a sig- 
nificant underdensity in the number of void absorbers with 
respect to the random expectation, it is not clear that such 
voids are actually present within the Ly-a forest population. 
Of course, the fact that we do not detect this under-density, 
does not imply that the gas voids are not there. A better way 
to look at these definitions is by considering void absorbers 
as those found in galaxy under-densities (galaxy voids) and 
void-edge absorbers as those found in regions with a typ- 
ical density of galaxies. We do not have a clear picture of 
what the unclassified absorbers correspond to. Unclassified 
absorbers are those lying at the largest distances from the 
cataloged voids, but this does not necessarily imply that they 
are associated with the highest density environments only. In 
fact, there could be high density regions also located close to 
void-edges, at the intersection of the cosmic web filaments. 
Given that voids of radius < 10 /i^^Mpc are not present in 
the current catalog it is also likely that some of the unclassi- 
fied absorbers are associated with low density environments. 
Therefore, one interpretation of unclassified-absorbers could 
be as being a mixture of all kind of environments, including 
voids, void-edges and high density regions. 

We checked the robustness of these definitions by look- 
ing at the number of voids and void-edges which can be as- 
sociated with a given absorber. In other words, for a given 
absorption system, we counted how many voids or void- 
edges could have been associated with it by taking simply 
X = s/_Rvoid or D = s — -Rvoid (in contrast to having taken 
the minimum values). Out of the 45 void absorbers, 41 are 
associated with only one void and 4 are associated with 2 
voids, independently of the definition used (either X or D). 
Likewise, out of the 44 (42) void-edge absorbers, 31 (28) are 
associated with just one void-edge, 12 (13) are associated 
with two void-edges and 1 (1) is associated with three void- 
edges. This last system is located at X = 1.04 [D = 0.55 
/i-^Mpc) and has JVm = io" "±o.35 ^^^^-2 ^ 2^+21 

km s~^ at a redshift of Zabs ~ 0.01533. From these values 
the system does not seem to be particularly peculiar. Find- 
ing an association with more than two void-edges is not sur- 
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Figure 3. H I column density distribution for the three different 
LSS defined in this work (see ^3.211 : void absorbers (solid- black 
lines), void-edge absorbers (red-dashed lines) and unclassified ab- 
sorbers (blue-dotted lines). Top panels show the normalized dis- 
tribution using arbitrary binning of 0.5 dex. Bottom panels show 
the cumulative distributions for the unbinned samples. Left and 
right panels correspond to absorbers defined using X and D co- 
ordinates respectively. 



prising as long as the filling factor of voids is not smal0. 
Void absorbers have on average 1.1 ± 0.3 voids associated 
with them, with a median of 1. Void-edge absorbers have in 
average 1.3±0.5 (1.4±0.5) void-edges associated with them, 
with a median of 1(1). These values give a median one-to- 
one association. Therefore, we conclude then that the LSS 
definitions used here are robust. 



3.3 Properties of absorption systems in different 
large scale structure regions 

Figure [2] shows the distribution of column densities and 
Doppler parameters as a function of both X and D. At first 
sight, no correlation is seen between A^'hi or foni and distance 
to the center of voids. Table [2] gives the mean and median 
values of log(Ai'H/[cm^^]) and 6hi for our void, void-edge 
and unclassified absorption systems. These results show con- 
sistency within la between the three LSS samples. 

A closer look at the problem can be taken by investi- 
gating the possible differences in the full Nm and 6hi distri- 
butions of void, void-edge and unclassified absorbers. 



For reference, voids found by P12 have a filling factor of 62%. 
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void / edge 


void/uncl. 


edge/uncl. 


void / not- void 


edge/not-edge 


uncl./not-uncl. 


KS-Prob(logAfHi) 
KS-Prob(bHl) 


2% (0.7%) 

8% (6%) 


74% (66%) 
18% (17%) 


56% (24%) 
71% (75%) 


4% (4%) 
7% (7%) 


3% (0.6%) 
20% (14%) 


64% (54%) 
32% (32%) 



"Results are presented in a X (D) format (see i|3.1ll . 



3.3.1 Column density distributions 

Figure [3] shows the distribution of column density for the 
three different LSS defined above (see H3.2p . The top pan- 
els show the normalized fraction of systems as a function of 
A'hi (arbitrary binning), whilst the bottom panels show the 
cumulative distributions (unbinned). We see from the top 
panels that this distribution seems to peak systematically 
at higher A'^hi from void to void-edge and from void-edge 
to unclassified absorbers. We also observe a suggestion of a 
relative excess of weak systems (A^hi ^ 10^^ cm~'^) in voids 
compared to those found in void-edges. This can also be seen 
directly in Figure [5] (see panels (a) and (b)). The KS test 
gives a probability -P^°fd'Jedge ~ 2% (0.7%) that void and 
void-edge absorbers come from the same parent distribu- 
tion. This implies a > 2ct difference between these samples. 
No significant difference is found between voids or void-edges 
with unclassified absorbers, for which the KS test gives prob- 
abilities of P'°^.^ , « 74% (66%) and P^°/'^, , « 56% 

void/uncl. \ '^J cdgc/uncl. 

(24%) respectively. These results can be understood by look- 
ing at the bottom panels of Figure^ as we see that the max- 
imum difference between the void and void-edge absorbers 
distributions is at A^'hi < lO'^^'* cm~^. On the other hand, 
no big differences are observed at A^hi ^ 10^^'* cm~^. In 
fact, by considering just the systems at A'hi < 10^'^ ** cm~^, 
the significance of the difference between void and void-edge 
absorbers is increased, with P^°^^^j^^^^ ~ 0.9% (0.2%). Like- 
wise, at A^Hi ^ 10^'^ * cm~^, void and void-edge absorber 
distributions agree at the ~ 86% (86%) confidence level. We 
note however that there were ^ 10 systems per sample for 
this last comparison and therefore, it is likely to be strongly 
affected by low number statistics. 

We also investigated possible differences between void, 
void-edge and unclassified absorbers and their complements 
(i.e., all the systems that were not classified as these: not- 
void, not- void-edge, not-unclassified). Not- voids correspond 
to the combination of void-edge and unclassified absorbers 
and so on. The KS gives probabilities of Pi°fd/„ot-void ~ 4% 

(4%), -PedlcTnot-cdgc « 3% (0.6%) implying that void and 
void-edge absorbers are somewhat inconsistent with their 
complements. On the other hand, the distribution of unclas- 
sified absorbers is consistent with the distribution of their 
complements with a KS probability of ^'unc/^not-unci ~ ^4% 
(54%). These results are summarized in Tabled 

3.3.2 Doppler parameter distributions 

Figure[4]shows the distribution of Doppler parameter for the 
three different LSS defined above (see ij3.2p . The top pan- 
els show the normalized fraction of systems as a function 
of &HI (arbitrary binning), whilst bottom panels show the 



0.08 




Figure 4. H I Doppler parameter distribution for the three differ- 
ent LSS defined in this work (see i]3.2ll : void absorbers (black-solid 
lines), void-edge absorbers (red-dashed lines) and unclassified ab- 
sorbers (blue-dotted lines). Top panels show the normalized dis- 
tribution using arbitrary binning of 5 km s~^. Bottom panels 
show the cumulative distributions for the unbinned samples. Left 
and right panels correspond to absorbers defined using X and D 
coordinates respectively. 

cumulative distributions (unbinned). This figure suggests a 
relative excess of 1ow-6hi systems (6hi ^ 20 km s~^) in voids 
compared to those from void-edge and unclassified samples. 
A relative excess of unclassified absorbers compared to that 
of voids or void-edges at high-fcHi values (6hi 35 km s~^) 
is also suggested by the figure. The KS test gives a prob- 
ability -Pvoid/cdgc ~ 8% (6%) that void and void-edge ab- 
sorbers come from the same parent distribution. This im- 
plies no detected difference between void and void-edge ab- 
sorbers. Likewise, no significant difference is found between 
voids or void-edges with unclassified absorbers, for which the 
KS test gives probabilities of Pvoid/und. ~ 18% (17%) and 
^'cdgo/unci. « 71% (75%) respectively. 

As before, we also investigated possible difference be- 
tween LSS and their complements. In this case, neither 
void, void-edge or unclassified absorbers are significantly 
different than their complements with KS probabilities of 

J'v'oid/not-void « 7% (7%), P.'dge/not-edge « 20% (14%) and 

^uncl./not-uncl. ~ 32% (32%). Thcse results are also summa- 
rized in Table [31 

3.4 Check for systematic effects 

Given that the differences between void and void-edge sam- 
ples are still at < 3a of confidence level, we have investigated 
possible biases or systematic effects that could be present in 
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our data analysis. In particular we have investigated (1) pos- 
sible differences in our subsample with respect to the whole 
DS08 sample, (2) the effect of the different characterization 
methods used by DS08 to infer the gas properties, and (3) 
whether uniformity across our redshift range is present in 
our observables. A complete discussion is presented in Ap- 
pendix |X] From that analysis we concluded that no impor- 
tant biases affect our results. 



4 COMPARISON WITH SIMULATIONS 

In this section, we investigate whether current cosmolog- 
ical hydrodynamical simulations can reproduce our ob- 
servational results presented in iJS] For this comparison 
we use the Galax ies-Interqala c tic M edium Interaction Cal- 
culation (gimic, I Grain et al.l |2009| ). Using initial condi- 
tions drawn from the Millennium simulation jSpringel et al.l 
120051 ). GIMIC follows the evolution of baryonic gas within 
five, roughly spherical regions (radius between 18 — 25 h^^ 
Mpc[3) down to 2 = at a resolution of mgas ~ 10^ h^^ 
Mq. The regions were chosen to have densities deviating 
by ("2, — 1, 0, +1, +2)cr from the cosmic mean at z = 1.5, 
where a is the rms mass fluctuation. The +2a region was 
additionally required to be centered on a rich cluster halo. 
Similarly although not imposed, the —2a region is approxi- 
mately centered on a sparse void. The rest of the Millennium 
simulation volume is re-simulated using only the dark mat- 
ter particles at much lower resolution to account for the tidal 
forces. This approach gives GiMiC the advantage of probing 
a wide range of environments and cosmological features with 
a comparatively low computational expense. 

GIMIC includes (i) a recipe for star formation 
designed to enforce a loc al Kennicutt- Schmidt law 
l|Schave fc Dalla Vecchial |2008| ): (ii) stellar evolution and 
the associated delaye d release of 11 chemical elements 
l|Wiersma et al.l|2009bl ): (iii) the contribution of metals to 
the cool ing of gas in the prese nce of an imposed UV back- 
ground jWiersma et al.|[2009al ): and (iv) galactic winds that 
pollute the IGM with metals and can quench sta r formation 
in low-mass halos (|Dalla Vecchia fc Schavell2008l ). Note that 
GIMIC does not include feedback processes associated with 
AGN. For further details about GiMiC we refer the reader to 
ICrain et all (|2009D . 



4.1 Simulated H I absorbers sample 

In order to obtain the properties of the simulated H I ab- 
sorption systems, we placed 1000 parallel sightlines within 
a cube of 20 ft-^Mpc on a side centered in each individ- 
ual GIMIC region ai z = (5000 sightlines in total). We 
have excluded the rest of the volume to avoid any possible 
edge effects. This roughly corresponds to 2.5 sightlines per 
square /i~^Mpc. Given this density, some sightlines could 
be tracing the same local LSS and therefore these are not 
fully independent. We consider this approach to offer a good 



^ Note that GIMIC adopted a, Hq = 100 h km s~lMpc~l,h = 
0.73, nm = 0.25, = 0.75, as = 0.9, fc = cosmology. These 
parameters are slightly different than the ones used in P12. 



compromise of having a large enough number of sightlines 
while not oversampling the limited GiMiC volumes. 

We used the program SPECWlZARi:[f| to generate syn- 
thetic normalized spectra associated to our sightlines using 
the method described by iTheuns et al.l ([l99 8bV specwiz- 
ARD calculates the optical depth as a function of veloc- 
ity along the line-of-sight, which is then converted to flux 
transmission as a function of wavelength for a given transi- 
tion. We only used H I in this calculation. The spectra were 
convolved with an instrumental spread function (Gaussian) 
with FWHM of 6.6 km s^^ to match the resolution of the 
STIS/HST spectrograpld. In order to mimic the continuum 
fitting process in real spectra, we set the continuum level of 
each mock noiseless spectrum at the largest flux value after 
the convolution with the instrumental profile. Given that 
the lines are sparse ai z — 0, there were almost always re- 
gions with no absorption and this last correction was almost 
negligible. 

We used 3 different signal-to-noise ratios in order to 
represent our QSO sample. Out of the total of 1000 per 
GIMIC region, 727 sightlines were modeled with S/N — 9 per 
pixel, 182 with S/N = 23 and 91 S/N = 2. These numbers 
keep the proportion between the different S/N values as it 
is in the observed sample (see last column in Table [T]|f|. 

We fit Voigt profiles to the synthetic spectra automat- 
ically using VPFlljfl, following the algorithm described by 
iGrighton et al.l l|2010l ). First, an initial guess of several ab- 
sorption lines is generated in each spectrum to minimize 
X?oducod- If the x?educed IS greater than a given threshold 
of 1.1, another absorption component is added at the pixel 
of largest deviation and Xrcducod is re-minimized. Absorp- 
tion components are removed if both Nhi< 10^^'^ cm~^ and 
feHi< 0.4 km s~^. This iteration continues until Xroduccd ^ 
1.1. Then, the Voigt fits are stored. We only kept absorption 
lines where the values of log A^hi and 6hi are at least 5 times 
their uncertainties as quoted by vpfit. 

The fraction of hydrogen i n the form of H I within 
GIMIC is obtained from cloudy llFerland et al.l after 
assuming an ionization background from Haardt fc Madaul 
(1996) that yields a photo-ionization rate F = 8.59 10"^* 
s^^. This ionization background is not well constrained at 
z ~ 0, so we use a post processing correction to account 
for this uncertainty. In the optically thin regiine F*^* "" oc i, 
where r is the optical depth (|Gunn fc Petersonlll965l ). Then, 
scaling the optical depth v alues is equivalent to scaling the 
ionization background (e.g.. lTheuns et al .ll998al :[D ave et al.l 
[l99i). First, we combined the five GiMiC regions using dif- 
ferent volume weights namely: (1/12, 1/6, 1/2, 1/6, 1/12) for 
the (—2, —1, 0, 4-1, +2) a regions respectively (see Appendix 
2 m iGrain et aLll2009l . for a justification of these weights). 
Then, we searched for a constant value to scale all the orig- 
inal optical depth values such that the mean fiux of the 
combined sample is equal to the observed mean flux of Ly- 
a absorption at low redshift. A second possibility is to scale 



^ Written by Joop Schaye, Craig M. Booth and Tom Theuns. 
^ Note that the majority of the Ly-a used in this work were 
observed with STIS/HST rather than FUSE. 

* Note that we have divided the mean S/N per two-pixel resolu- 
tion element by ^2 to have an estimation per pixel. 
^ Written by R.F. Carswell and J. K. Webb (see 
http://www.ast.cam.ac.uk/~rfc/vpfit.html). 
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Figure 5. Redshift number density of H I lines as a function of both column density (left panel) and Doppler parameter (right panel) using 
abritrary binning of AlogA^Hi = 0.2 dex and Afejil = 5 km s~^ respectively. Both results have not been corrected for incompleteness. 
Green open-circles correspond to real data from the total sample of DS08 (657 systems) while blue fiUed-circles (slightly offset in the 
x-axes for clarity) correspond to the subsample used in this study (106 systems). The black line corresponds to the volume-weighted 
result from the combination of the five GIMIC regions where the shaded region correspond to the ztlcr uncertainty. Dashed lines show 
the results from each individual GIMIC region. Error bars correspond to the Poissonian uncertainty from the analytical approximation 
0-+ a Jn+l + 1 and a' Jn - | l|Gehrelslll98^ 1. 



the optical depth values in order to match the redshift num- 
ber density of H I lines in some column density range, dN/dz, 
instead of the mean flux. Ideally by matching one observable 
the second would be also matched. 

Extra polati ng th e double power-law fit result from 
iKirkmanetTal] (|2007l ) to 2: = (see their equation 6), the 
observed mean fiux is {F) = 0.987 with a typical statistical 
uncertainty of o-^p^ ~ 0.003. In order to match this number 
in the simulation a scale of 1.16 is required in the origi- 
nal optical depth values (0.86 in V). From this correction, 
the redshift number density of lines in the range 10^^'^ ^ 
A^Hi ^ 10^*^ cm~^ is found to be dN/dz « 50. For reference, 
iLehner et all (120071 ) and DS08 found dN/dz ~ 50 - 90 over 
the same column density range. We have repeated the exper- 
iment for consistency, using = {0.984, 0.990} which are 
within ±lcr(^) of the extrapolated value. To match these, 
scales of 1.50 and 0.84 are required in the original opti- 
cal depths values respectively (0.67 and 1.19 in F). From 
those mean fiuxes we found dN/dz ~ {70,35} respectively 
along the same column density range. Therefore, a value of 
{F) — 0.990 underpredicts the number of H I lines. On the 
other hand, values of {F) = 0.987 and 0.984 are in good 
agreement with observations. In the following analysis we 
use (F) = 0.987 unless otherwise stated. 



4.2 Comparison between simulated and observed 
H I properties 

Figure [S] shows the redshift number density of H I lines 
(not corrected for incompleteness) as a function of both 
column density (left panel) and Doppler parameter (right 



panel). Data from the simulation are shown by the black line 
(volume-weighted result) and each individual GiMiC region 
is shown separately by the dashed lines. For comparison, 
data from observations are also shown. Green open-circles 
correspond to the total sample from DS08 (657 systems) 
while blue fiUed-circles correspond to the subsample used in 
this study (106 systems that intersect the SDSS volume). 
There is not perfect agreement between simulated and real 
data. We see an excess (lack) of systems with A^hi ^ 10^^'^ 
cm~^ (A^Hi 10^'* cm~^) in the simulation compared to ob- 
servations while Doppler parameters are in closer agreement, 
although there is still a difference at low bm- 

Assuming that the column density distribution can be 
modeled as a power-law, the position of the turnover at the 
low TVhi end give us an estimation of the completeness level 
of detection in the sample. As the turnover appears to be 
around A'^ei ~ 10^^ cm~^ in both simulated and real data 
(by design) we do not, in principle, attribute the discrep- 
ancy in the column-density distributions to a wrong choice 
of the simulated S/N. Raising the mean flux to a greater 
value than (F) = 0.987 (less absorption) does not help as 
the dN/dz in the range lO^^'^ ^ JVhi 10^" cm'^ wiU 
then be smaller than the observational result (see ^4.1|) . We 
attempted to get a better match by using a mean flux of 
(F) = 0.984 (more absorption), motivated to produce a bet- 
ter agreement at higher column densities. In order to agree 
at low column densities, we had to degrade the sample S/N 
to be composed of ~ 400, ^ 100 and ~ 500 sightlines at 
signal-to-noise ratios of 9, 23 and 2 respectively. It is im- 
plausible that half of the observed redshift path has such 
poor quality. 

Another possibility to explain the discrepancy could be 
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Figure 6. Column density (left panel) and Doppler parameter (right panel) cumulative distributions for H I. Void absorbers are shown 
by solid-black lines while void-edge absorbers are shown by solid-red lines. Dashed lines show the result from each individual GIMIC region. 
For simplicity we only show LSS definitions based on X. 



the fact that weak systems in observations were preferen- 
tially characterized with the AOD method, whereas here we 
have only used Voigt profile fitting. In order to test this hy- 
pothesis, we have merged closely separated systems (within 
150 km s~^) whose summed column density is less than 
10^^'^ cm~^. Using these constraints, 43 out of 4179 sys- 
tems were merged (~ 1%). Such a small fraction does not 
have an appreciable effect on the discrepancy. As an extreme 
case, we have repeated the experiment merging all systems 
within 300 km s~^ independently of their column densities. 
From this, 555 out of 4179 systems were merged (~ 13%) 
but still it was not enough to fully correct the discrepancy. 
Given that the discrepancy is not explained by a system- 
atic effect from different line characterization methods, we 
chose to keep our original simulated sample in the following 
analysis without merging any systems. 

There is a reported systematic effect by which column 
densities inferred from a single Ly-a line are typically (with 
large scatter) underestimated with respect to the curve-of- 
growth (COG) solution. Similarly bm are ty pically overesti- 
mated (shuU et al.ll2000l : lDanforth et al.|[2006i : see also Ap- 
pendix [X2l for discussion on how this may affect our observa- 
tional results). This effect is only appreciable for A''hi > 10^* 
cm~^ and is bigger for saturated lines. Given that our sim- 
ulated sample was constructed to reproduce the observed 
sample, this effect could be present. If so, it would in princi- 
ple help to reduce the discr epancy at the h i gh col umn den- 
sity end. From figure 3 of iDanforth et al.l l|2006l ) we have 

of, 



inferred a correction for systems with Nui ^ lO^'^'^ cm ^ 



log iVni" 



log N^f - 8.37 



0.62 



(3) 



vjji and A^Hi'^ ^'^^ the corrected and observed 
A'hi values respectively. From this correction we found an 
increase in the number of systems at A'hi ^ 10^*'^ cm~^ up 
to values consistent with observations. This however does 
not help with the discrepancy at lower column densities. 
At this point, it is difficult to reconcile the simulation 



result with the real data using only a single effect. We note 
that the discrepancy is a factor of ~ 2 only, so it could be 
in principle explained through a combination of several ob- 
servational effects. Also note that the number of observed 
lines at higher column densities is still small and it could be 
affected by low number statistics. The lack of systems with 
very low A'^ei and bni values can be explained by our selec- 
tion of the highest signal-to-noise being SjN — 23 while in 
real data there could be regions with higher values. It is not 
the aim of this section to have a perfect match between sim- 
ulations and observations but rather examine the qualitative 
differences between simulated regions of different densities. 
Thus hereafter, we will use the results from the simulation 
in its original form (as shown in Figure [5}, i.e., without any 
of the aforementioned corrections. 



4.3 Simulated H I absorbers properties in 
different LSS regions 

Given that GiMiC does not provide enough volume to per- 
form a completely analogous search for voids (each region is 
~ 20 /i~^Mpc of radius), we use them only as crude guides to 
compare our results with. We could consider the —2a region 
as representative of void regions as it is actually centered 
in one. Naively, we could consider the Oct regions as repre- 
sentative of void-edge regions, as it is there where the mean 
cosmological density is reached. A direct association for the 
-|-1ct and -|-2ct is not so simple though, as they would be asso- 
ciated to some portions of the void-edge regions too. It seems 
more reasonable to use the GiMiC spheres as representative 
of different density environments then, where —2a/+2a cor- 
respond to extremely under/over-dense regions and so on. 
For reference, the whole (—2, —1, 0, +1, +2)ct GIMIC regions 



LSS in absorption: Gas within and around Voids 11 



Table 4. Kolmogorov-Smirnov (KS) test probabilities be- 
tween different GIMIC regions? 



A^HI 


-2a 


-la 


Oct 


+ 1<J 


+2a 


-2(7 




30% 


2% 


2% 


« 1% 


-la 


77% 




6% 


< 1% 


< 1% 


Oct 


11% 


0.9% 




0.7% 


< 1% 


+1ct 


< 1% 


< 1% 


0.4% 




« 1% 


+ 2(7 


< 1% 


« 1% 


« 1% 


< 1% 





^KS test probabilities for log A^hi and bui distributions are shown 
in the left-bottom and right-upper sides respectively. <C 1% 
corresponds to values < 10~^%. 



correspond to densities of 



(0.4,0.6,0.9, 1.2, l.Snl at 

z = respectively, where ( p) is the me an density of the 
universe (see Figure Al from lCrain et al] |200i?!V 

Figure [6] shows the cumulative distributions of A^hi (left 
panel) and 6hi (right panel). Results from each of the indi- 
vidual GIMIC region are shown by dashed lines. Void and 
void-edge absorbers are shown by solid black and red lines 
respectively. For simplicity we show only LSS definition 
based on X. Cumulative distributions between real and sim- 
ulated data do not agree perfectly. However, in both real 
and simulated data, there is an offset between column den- 
sities and Doppler parameters found in different environ- 
ments. Low density environments have smaller values for 
both Ah: and 6hi than higher density ones (and viceversa). 
This trend still holds when using a S/N — 23 per pixel for 
the 5000 sightlines. 

The KS test gives a significant difference between the 
+2a, +la and +0a regions, and any other GiMiC region at 
the > 99%, > 99% and > 95% confidence level respectively 
in both Ahi and bm distributions. The KS test gives no 
significant difference between the —2a and —la regions in 
both Ahi and bm distributions (see Table 14)1. These results 
do not change significantly when correcting GiMiC to match 
the observed Ahi distribution using a different {F} and S/N 
values. We do not attempt to make a more detailed com- 
parison between distributions coming from real data (void, 
void-edge samples) and the different GiMiC regions as there 
are already known differences between them (see i)4.2|l . 



5 DISCUSSION 

5.1 Three Ly-a forest populations 

Our first result is that there is a > 99% c.l. excess of Ly- 
Q systems at the edges of galaxy voids compared to a random 
distribution (see ij3.1l and Figure[T}. Our random sample was 
normalized to have the same density of systems in the whole 
volume. Then, an excess in a sub-volume means necessarily 
a deficit in another. Given that we found no significant dif- 
ference in the number of systems in voids with respect to the 

Given that we are using cubic sub-volumes centered in these 
spheres, these cubes should have higher density differences be- 
tween them. 



random expectation, the excess is not explained by a deficit 
of Ly-a systems inside galaxy voids. The observed excess 
seems more related to the lack of systems found at distances 
> 5 /i~^Mpc outside the cataloged voids (and viceversa). 
Thus, despite the fact that we see Ly-a clustered at the 
edges of galaxy voids, it is not cle ar from this data that Ly- 
a- voids at low-z exist at all (see ICarswell fc Reej 1987 for 
similar result at high-z; although see IWilliger et al. I I2OOOI ). 
This picture is somewhat different from the case of galax- 
ies, where galaxy voids a re present even in the distribution 
of low mass galaxies fe.g.. |Peeblell200ll : lTikhonov fc Klvpiiil 
120091 ). There is agreement though in the sense that both Ly- 
Q system and galaxy distributions have their peaks at the 
edges of galaxy voids. We observe a typical scale length of 
the excess to be ~ 5 /i~^Mpc, consistent with numerical pre- 
dictions for the typical radius of the filamentary structure of 
the 'cosmic web' (^ 2 fe~'^Mpc; seeiG onzalcz & Padilla 201^; 
lAragon-Calvo et al.ll201ol : iBond et al.ll2010l ). Note that this 
scale length is approximately twice the scale associated with 
a velocity uncertainty of Aw ~ 200 km s~^ at z — 0. Such 
dispersions could be present in our void-edge sample. 

In our data analysis we have defined three samples of 
absorption systems, based on how they are located with re- 
spect to the closest galaxy void (see H3.2p . Let us consider 
now a very simple model in which we have only two LSS en- 
vironments: under-dense and over-dense LSS. Then we could 
relate all the 'random-like' Ly-a forest systems found in the 
void sample {X < 0.9 and/or D < -2 h'^Mpc) with the 
under-dense LSS, while all the systems associated with the 
excess over random (0.9 ^ A < 1.3 and/or —2 ^ D < 4 
h~^Mpc) to the over-dense LSd"]. Note that the fact that 
we cannot distinguish between the under-dense LSS Ly- 
a distribution and a random distribution does not mean 
that the former is really random. If these Ly-a forest sys- 
tems follow the u nderlying dark matter distribution (e.g., see 
ICroft et al.ll 19981 ). they should have a non-negligible cluster- 
ing amplitude that is not observed only because of the lack 
of statistical power of our sample. In relative terms, con- 
sidering the under-dense LSS Ly-a as random seems to be 
a good approximation though, especially for the large scale 
distances involved in this work (> 1 Mpc). This is also sup- 
ported by the very low auto-correlation amplitude observed 
in th e whole population of Ly-a forest systems at such scales 
(e^ lCroft et al.lll998l : iRollinde et aLll2003l : ICrighton etHI 
[20111 ). This 'random' behavior of Ly-a systems in the under- 
dense LSS can be understood as originating in structures 
still evolving from the primordial density perturbations in 
the linear regime. At z = however, the majority of the mass 
resides at the edges of voids (in the 'cosmic web') whose den- 
sity perturbations have reached non-linear evolution regime 
at higher redshifts. For reference, we expect the under-dense 
and over-dense LSS to have typical 5 >0 and 5 <0 respec- 
tively, where S is the density contrast defined as, 

c _ p-(p) 

° - (p) ' 



(4) 



where p is the density and (p) is the mean density of the 
universe. Note however that these LSS environments are not 
defined by a particular density but rather by a topology 
(voids, walls, filaments). 



We have left the unclassified systems out of this interpretation. 
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Theoretical arguments point out that the observed col- 
umn density of neutral hydrogen at a fixed z is, 



AT ^, „l-5 T^-0.26 T^- 

A'hi CC 1 I 



1 rO.5 

J a 



(5) 



where is the density of hydrogen, T is the temperature 
of the gas, F is the hydrogen photo-i onization rate and fg 
is the fraction of mass in gas (jSchave 2001). In the diffuse 
IGM it has been pre dicted that T oc pg, where a ~ 0.59 
l|Hui fc Gnedinlll997l '). This implies that for a fixed T, the 
main dependence of A'hi is due to pu as A^hi oc pJi^- Then, 
despite the extremely low densities inside galaxy voids we 
can still observe Ly-a systems, although only the ones cor- 
responding to the densest structures. 

Let us consider the predicted ratio between A'hi ob- 
served inside voids and at the edges of voids as. 



-'*HI 



„void 

Ph 

cdec 



J^cdgc 



■void 
9 



rcdgc 
.13 



(6) 



Given that the timescale for photons to travel along ~ 10 — 
20 ft-^Mpc is < 1 Gyr, we can consider T™'"* ^ r'^'^^o 
Even if we assume that the gas inside voids has not formed 
galaxies, /™"* > fg'^^'^, because fg is dominated by the dark 
matter. This implies that a given observed A'hi inside and at 
the edge of galaxy voids will correspond to similar densities 
of hydrogen (ph°"^ ~ Ph'^'')- This is important because it 
means that the Ly-a forest in the under-dense LSS is not 
different than the over-dense LSS one, and two systems with 
equal A^hi are comparable, independently of its large scale 
environment. 

If there were no galaxies, this simple model may suffice 
to explain the differences in the observed Ly-a population. 
The fact that some of the Ly-a systems are directly 
associated with gala^des cannot be neglected though. There 
is strong evidence from observations (e . g., Lanzetta et al.l 



2ooe 
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and simulations (e.g. 

I2OIII ) that A'hi ^ 10^^ cm~^ systems are preferentially 
found within a couple of hundred kpc of galaxies. Probably 
an appropriate interpretation of such a result is that galaxies 
are always found in 'local' (< 100 h~^kpc) high A^hi density 
regions. Then, a plausible scenario would require at least 
three types of Ly-a forest systems: (1) containing embedded 
galaxies, (2) associated with over-dense LSS but with no 
close galaxy and (3) associated with under-dense LSS but 
with no close galaxy. For convenience, we will refer to the 
first type as 'halo-like', although with the caution that 
these systems may not be gravitationally bound with the 
galaxy. 

Given that there are galaxies inside galaxy voids, the 
'halo-like' Ly-Q systems will be present in both low and 
high density LSS environments (galaxies are a 'local' phe- 
nomenon). The contribution of the 'halo-like' in galaxy voids 
could be considered small though. Assuming this contribu- 
tion to be negligible, we can estimate the fraction of Ly- 
Q systems in the under-dense LSS as ~ 25 — 30% ± 69^^1. 

12 Tliese numbers come from 28 ± 5(22 ± 5) and 61 ± 8(65 ± 8) 
systems found at X < 0.9 (D < -2 /i-^Mpc) and at 0.9 ^ X < 
1.3 (-2 s£ D < 4 h-lMpc) respectively (see j3ll . 



Likewise, ~ 70 — 75% ± 12%^^ of the Ly-a forest popula- 
tion are due to a combination of systems associated with 
galaxies and systems associated with the over-dense LSS. 
We could estimate the contribution of 'halo-like' absorbers 
by directly looking for and counting galaxies relatively close 
to the absorption systems. A rough estimation can be done 
by assuming that galaxy halos will have only A'hi ^ 10^* 
cm~^ systems, leading to a contribution of ~ 12— 15%=b4/^F^ 
in our sample. 

In summary, our results require at least three types of 
Ly-a systems to explain the observed Ly-a forest population 
at \ow-z (A'hi > lO^^ '^ cm'^): 

• Halo-like: Ly-Q with embedded nearby galaxies (< 100 
h'^kpc) and so directly correlated with galaxies (~ 12 — 
15% ±4%), 

• Over-dense LSS: Ly-a associated with the over-dense 
LSS that are correlated with galaxies only because both pop- 
ulations lie in the same LSS regions (« 50 — 55% ±13%) and, 

• Under-dense LSS: Ly-Q associated with the under- 
dense LSS with very low auto-correlation amplitude that 
are not correlated with galaxies (~ 25 — 30% ± 6%). 

The relative contributions of these different Ly-a popula- 
tions is a function of the lower A'hi limit. Low A'hi systems 
dominate the Ly-Q column density distribution. Then, given 
that under-dense LSS Ly-a systems tend to be of lower col- 
umn density than the other two types, we expect the contri- 
bution of 'random-like' Ly-a to increase (decrease) while ob- 
serving at lower (higher) A'hi limits. Note that there are not 
sharp A'hi limits to differentiate between our three popula- 
tions (see Figure [2|. The 'halo- like' is defined by being close 
to galaxies while the 'LSS-like' ones are defined in terms of 
a LSS topology (voids, wall, filaments). 

Motivated by a recently pu blished study on the Ly- 
a/galaxy association bv Prochas ka et al.l (|201ll ). we can set 
a conservative upper limit to the 'halo-like' contribution. 
These authors have found that nearly all their observed 
L ^ O.OIL* galaxies (33/37) have A'hi > 10^^'^ cm~^ ab- 
sorption at impact parameters < 300 /ifj^kpc. If we invert 
the reasoning and assume an extreme (likely unrealistic) sce- 
nario where aZZthe A^hi ^ 10^^'^ cm~^ are directly associated 
with galaxies, then, the 'halo-like' contribution will have an 
upper limit of < 33% ± 79^F^. Consequently, the contribu- 
tion of the over-dense LSS to the Ly-a population will be 
> 37 — 42%±14%. Still, note that we have found several sys- 
tems with lO"'^ < A'hi < 10^*'^ cm"^ inside galaxy voids 
for which a direct association with galaxies is dubious (see 
Figure [Hi. Also note that only ~ 10% (~ 0%) of the Ly- 
Q systems between 10^^'^ < A'hi < 10^"'* cm"^ may be 
associated with a galaxy at impact para meters < 300 kpc 
(< 100 kpc) in the lProchaska et al.l (|201ll ) sample (see their 
figure 4). 

Our findings are consistent with previous studies point- 
ing out a non-negligible contribution of 'rand om' Ly-a sys- 
tems (at a similar TVhi limit) of fa 20 - 30% ||Mo fc Morrid 

From either 13/89 (excluding the unclassified sample) or 
13/106 (including the unclassified sample). We have assumed 
Poisson uncertainty. 

From either 29/89 (excluding the unclassified sample) or 
35/106 systems in our sample (including the unclassified sample). 
We have assumed Poisson uncertainty. 
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ll994l : IStocke et al.ll 19951 : iPenton et al.ll2002l '). These authors 
estimated that « 70 — 80% of the Ly-Q population is asso- 
ciat ed with either LSS ( galaxy filaments) or galaxies. Note 
that lMo fc Morris! (| 19941 ') put a upper limit of f» 20% being 
directly associated with galaxies, which is also consistent 
with our estimation. Our result is also in accordance with 
the previous estimation that 22% ± 
based on 8 systems) and 17% ±4% ( 



8% (Penton et al. 


2002 


(Wakker & Savage 


200£ 



based on 17 systems) of the Ly-a systems lie in voids (de- 
fined as locations at > 3 h^g Mpc from the closest > L* 
galaxy). This is in contrast with early mo dels that asso- 
ciated all Ly-Q system s with galaxies (e.g., iLanzetta et al.l 
ll995l : IChen et ahll 19981 ). 

Although there is general agreement with recently pro- 
posed mo dels to explain the origin of the low-z Ly-a for - 
est (e.g., IWakker fc Savagd l2009l : iProchaska et alj l201lD . 
we emphasize that our interpretation is qualitatively dif- 
ferent and adds an important component to the picture: 
the presence of the under-dense LSS ('random-like') sys- 
tems. For instance, assuming infinite fila ments of typical 
widths of ~ 400 /i^j^kpc around galaxies, IProchaska et al.l 
l|201ll ) argued that all Ly-a systems at low-z belong ei- 
ther to the circum-galactic medium 

(CgMEI; 

which includes 

our 'galaxy halo' definition) or the filamentary structure in 
which galaxies reside (equivalent to our over-dense LSS def- 
inition). Our findings are not fully consistent with this hy- 
pothesis, as neither the 'CGM model' nor the 'galaxy fil- 
ament model' seem likely to explain the majority of our 
under-dense LSS absorbers at iVni < 10^^'^ cm~^. To do 
so there would need to be a whole population of unob- 
served galaxies (dwarf spheroidals?) inside galaxy voids with 
an auto-correlation amplit ude as low as the 'random -like' 
Ly-Qf one. As discussed by iTikhonov fc KlvpinI |2009| . very 
low surface brightness dwarf spheroidals could be a more 
likely explanation than dwarf irregulars because the latter 
should have been observed with higher incidences in re- 
cent H I emission blind su rveys inside galaxy voids (e.g., 
HIPASS. IPovle et al.llioosl 'l. On the other hand, the forma- 
tion of dwarf spheroidals inside galaxy voids is difficult to be 
explained from the curren t galaxy formation paradigm (see 
ITikhonov fc Klvpinll200^ for further discussion). As men- 
tioned, it seems more natural to relate the majority of the 
under-dense LSS absorbers with the peaks of extremely low 
density structures inside galaxy voids, still evolving linearly 
from the primordial density perturbations that have not 
formed yet galaxies because of their low densities. Our in- 
terpretation can be tested by searching for galaxies close 
to our lowest A''hi void-absorbers (see Figure (21). Another 
prediction of our interpretation is that the vast majority of 
A'^Hi 10^^ cm~^ systems should reside inside galaxy voids. 
If the QSO sightlines used here were observed at higher sen- 
sitivities, weak Ly-Q systems should preferentially appear 



According to the IProchaska et al. 1 1I2OIII ') definition, the CGM 
corresponds to highly ionized medium around galaxies at dis- 
tances greater than the virial radius but smaller than ~ 300 kpc, 
that need not be causally connected (associated, gravitationally 
bound) with these galaxies. We do not see a clear advantage of 
adopting this terminology and so wc use 'IGM' instead to re- 
fer to the same medium. We only make a distinction between 
LSS (voids, walls, filaments) and the ones with embedded nearby 
galaxies (< 100 /i~^kpc). 



at X < 1 (D < /i"^Mpc). Therefore, we should expect 
to have an anii-correlation between A'^ei ^ 10^"^ cm~^ and 
galaxies. 

5.2 A^Hi and bm distributions 

Our second result is that there is a systematic difference 
(> 98% c.l.) between the column density distributions of 
Ly-Q systems found within, and those found at the edge of, 
galaxy voids. Void absorbers have more low column density 
systems than the void-edge sample (see Figure [3]) . A simi- 
lar trend is found in GIMIC, where low density environments 
present smaller A''hi values than higher density ones (see 
Figure [6l left panel). This can be explained by the fact that 
baryonic matter follows the underlying dark matter distri- 
bution. Then, the highest density environments should be 
located at the edges of voids (in the intersection of walls 
and filaments), consequently producing highe r column den - 
sity absorption than in galaxy voids (e.g., see ISchav3l200ll ) . 

Also, by construction, there is a higher chance to find 
galaxies at the edges rather than inside galaxy voids. As- 
suming that some of the Ly-Q forest are associated with 
galaxy halos (see ^S.ll for further discussion), then this pop- 
ulation should be present mainly in our void-edge sample. 
As galaxy halos correspond to local density peaks, we should 
also expect on average higher column density systems in 
this population. Given that galaxies may affect the proper- 
ties of the surrounding gas, there could be processes that 
only affect Ly-Q systems close to galaxies. For instance, the 
distribution of Ly-Q systems around galaxy voids seems to 
show a two-peaked shape (see Figure [T|) . We speculate that 
this could be a signature of neutral hydrogen being ionized 
by t he ultra-violet backgro und produced by galaxies (see 
also lAdelberger et al.ll2003l ). mostly affecting iVni < lO" 
cm~^ inside the filamentary structure of the 'cosmic web'. 
Another explanation could be that in the inner parts of the 
filamentary structure, Ly-Q systems get shock heated by the 
large gravitational po tentials, raising their t emperature and 
ionization state (e.g.. [Cen fc Ostrikeij[l999l ). A third possi- 
bility is that it could be a signature of bulk outfiows as the 
shift between peaks is consistent with a Av « 200 — 300 
km s~^. On the other hand, the two peaks could have dis- 
tinct origins as the first one may be related to an excess 
of A'^Hi ^ 10^^ cm~^ systems, probably associated with the 
over-dense LSS in which galaxies reside, while the second 
one may be related to an excess of A'^ei ^ 10^* cm~^ sys- 
tems, more likely associated with systems having embedded 
galaxies. As mentioned, we cannot prove the reality of this 
two-peaked signature at a high confidence level from the 
current sample and so we leave to future studies the confir- 
mation or disproof of these hypotheses. 

The GIMIC data analysis shows a clear differentiation of 
6hi distributions in different density environments (see Fig- 
ure [S] right panel). Low density environments have smaller 
&HI values than higher density ones. We see a similar trend in 
the real data between our void and void-edge absorber sam- 
ples, although only at a > 90% of confidence level (i.e., not 
very significant; see Figure |4|. The main mechanisms that 
contribute to the observed line broadening are temperature, 
local turbulence and bulk motions of the gas (excluding sys- 
tematic effects from the line fitting process or degeneracy 
with A^Hi for saturated lines). Naturally, in high density en- 
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vironments, we would expect to have greater contributions 
from both local turbulence and bulk motions compared to 
low density ones. The gas temperature is also expected to 
increase from low density environments to high density ones. 
As previously mentioned, theoretical arguments predict the 
majority of the diffuse IGM will have temperatures related 
to the density by T oc with a > llHui fc GnedinI 1 19971 : 
iTheuns et all Il998bl : ISchave et all Il999^ . This is also seen 
in density-temperature diagrams drawn fr om current hydro- 
dynamical cosmological si mulations (e.g., iDave et al.ll201ol : 
iTepper-Garci'a et al.llioi^ '). Therefore, our findings are con- 
sistent with current expectations. 



5.3 Future work 

The high sensitivity of the recently installed Cosmic Origins 
Spectrograph (COS/HST, Green et al. 20lj) in the UV (es- 
pecially the far-UV), will allow us to improve the A'hi com- 
pleteness limit compared with current surveys. This will con- 
siderably increase the number of observed Ly-a absorption 
systems at low-z. In the short term, there are several new 
QSO sightlines scheduled for observations (or already ob- 
served) with COS/HST that intersect the SDSS volume. 
Combining these with current and future galaxy void cata- 
logs, we expect to increase the statistical significance of the 
results presented in this work. COS/HST will also allow ob- 
servations of considerably more metal lines (especially O vi) 
than current IGM surveys. Again, in combination with LSS 
surveys, this will be very useful for studies on metal en- 
richment in different environments. For instance, we have 
identified 8 systems with observed O VI absorption from 
STIS/HST in our sample. Three of these lie inside voids 
at X f» {0.6,0.7,0.9} (D ^ {-5.4,-4.6,-1.6} /I'^Mpc) 
respectively. The first two systems that lie inside voids cor- 
respond to the highest A'hi values (Ahi> 10^*'^ cm~'^; see 
Figure [2]). We have performed a search in the SDSS DR8 for 
galaxies in a cylinder of radius 1 h^i Mpc and within ±200 
km s~^ around these 2 absorbers (both systems belong to 
the same sightline and are at a similar redshift; one of them 
shows C IV absorption also). We found 9 galaxies with these 
constraints, hinting on a possible association of these sys- 
tems with a void galaxy. The one at the very edge of the void 
limit has Ah: = 10i3 "±0 07 ^^^^-2 ^^^^ ^ ^^gis.egio.is 

cm~^, and it could in principle be associated with the over- 
dense LSS. The other 5 O vi absorbers lie in our void-edge 
sample and have Ahi > 10^^'^ cm~^, so they are likely to 
be associated with galaxies. None of the observed O vi lie 
in our unclassified sample. The current sample of O vi sys- 
tems is very small, and so we do not aim to draw statistical 
conclusions from them. However, these systems individually 
offer interesting cases worth further investigation. We in- 
tend to perform a carefully search for galaxies that could be 
associated to each of the Ly-a absorbers presented in our 
sample in future work. In the longer term, it will be possible 
to extend similar analysis to well defined gala:xy filaments 
and clusters when the new generation of galaxy surveys are 
released. 

A scenario with three different types of Ly-a forest sys- 
tems, as proposed here, can help to interpret recent measure- 
ments of the cros s -correlation between Ly-a and galajcies 
dChen et al.l l2005l: iRvari-Webej l2006l: IWilman eFall l2007l: 
IChen fc MulchaevI l2009l : IShone et all I2OI0I : iRudie et all 



l2012h . These studies come mainly from pencil beam galaxy 
surveys around QSO sightlines where identifying LSS such as 
voids or filaments is more challenging. As mentioned, differ- 
ent Ly-a systems are not separated by well defined Ahi lim- 
its and so we suggest using our results to properly account 
for under-dense LSS ('random-like') absorbers in gas/galaxy 
cross-correlations. Truly random distributions are easy to 
correct for, as they lower the amplitude of the correlations 
at all scales. Then, acknowledging these 'random-like' ab- 
sorbers, it will be possible to split the correlation power in 
its other two main components: gas in galaxy halos and gas 
in the over-dense LSS. Our group is currently working in 
a future paper to study the gas/galaxy cross-correlation, in 
which these corrections will be taken into account. 



6 SUMMARY 

We have presented a statistical study of H I Ly-a absorption 
systems found within and around galaxy voids at z < 0.1. 
We found a significant excess (> 99% c.l.) of Ly-a systems 
at the edges of galaxy voids with respect to a random dis- 
tribution, over a ~ 5 h~^Mpc scale. We have interpreted 
this excess as being due to Ly-a systems associated with 
both galaxies ('halo-like') and the over-dense LSS in where 
galaxies reside (the observed 'cosmic web'), accounting for 
^ 70 - 75% ± 12% of the Ly-a population. We found no 
significant difference in the number of systems inside galaxy 
voids compared to the random expectation. We therefore in- 
fer the presence of a third type of Ly-a systems associated to 
the under-dense LSS with a low auto-correlation amplitude 
(~ random) that are not associated with luminous galaxies. 
These 'random-like' absorbers are mainly found in galaxy 
voids. We argue that these systems can be associated with 
structures still growing linearly from the primordial den- 
sity fluctuations at 2: = that have not yet formed galaxies 
because of their low densities. Although the presence of a 
'random' population of Ly-a absorbers was also inferred (or 
assumed) in previous studies, our work presents for the first 
time a simple model to explain it (see i]5.ip . Above a limit 
of Ahi > 10^^-^ cm'^ we estimate that ^ 25 - 30% ± 6% 
of Ly-a forest systems are 'random-like' and not correlated 
with luminous galaxies. Assuming that only Ahi ^ 10^^ 
cm~'^ systems have embedded galaxies nearby, we have esti- 
mated the contribution of the 'halo-like' Ly-a population to 
be 12 - 15% ± 4% and consequently ^ 50- 55% ± 13% of 
the Ly-a systems to be associated with the over-dense LSS. 

We have reported differences between both the column 
density (Ahi) and the Doppler parameter (&hi) distributions 
of Ly-a systems found inside and at the edge of galaxy 
voids observed at the > 98% and > 90% of confidence 
level respectively. Low density environments (voids) have 
smaller values for both Ahi and 6hi than higher density ones 
(edges of voids). These trends are theoretically expected. We 
have performed a similar analysis using simulated data from 
GIMIC, a state-of-the-art hydrodynamical cosmological sim- 
ulation. Although GIMIC did not give a perfect match to the 
observed column density distribution, the aforementioned 
trends were also seen. Any discrepancy between GiMiC and 
real data could be due to low number statistic fluctuations 
and/or a combination of several observational effects. 

In summary, our results are consistent with the expec- 
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tation that the mechanisms shaping the properties of the 
Ly-a forest are different in different LSS environments. By 
focusing on a 'large scale' (> 10 Mpc) point of view, our 
results offer a good complement to previous studies on the 
IGM/galaxy connection based on 'local' scales (< 2 Mpc). 
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APPENDIX A: CHECK FOR SYSTEMATIC 
EFFECTS 

In this section we investigate possible biases or systematic 
effects that could be present in the data analysis. 

Al Comparison between our subsample and the 
whole DS08 sample 

In this section wc explore whether the subsample of H I sys- 
tems used here is statistically different from the rest of the 
H I population found in the other sightlines of the DS08 
catalog. For this, wc compared the column density (A'hi) 
and Doppler parameter (6hi) distributions between the 106 
absorption systems found inside the void catalog volume 
(0.01 ^ 2abs ^ 0.102 in 11 sightlines) with those 545 out- 
side this volume. The KS test gives a probability of « 5% 
and ~ 90% that both A^hi and bm distributions, inside and 
outside the void catalog volume, are drawn from the same 
parent distribution respectively. This shows there is no sig- 
nificant difference in the 6hi distribution and an 2a dif- 
ference between the A^hi distributions. 

One explanation for such a difference could be due to an 
intrinsic evolution of the Ly-a forest between Zabs ^ 0.102 
and 0.102 < 2;abs < 0.4. To check this, we performed a KS 
comparison between the 286 absorbers at 2:abs < 0.102 (re- 
gardless of whether they arc inside the void catalog volume 
or not) with the 365 systems at higher redshifts, to look for 
a possible difference in both A^hi and 6hi distributions. The 
distribution of 6hi does not show any difference (KS Prob. 
~ 87%). On the other hand, the KS probability for the two 
A'hi distributions is ~ 8%, hinting that such evolution could 
be present. We note, however, that an observational bias be- 
tween low and high-« systems (e.g., due to different selection 
functions) could also explain the observed difference. We did 
not further explore this matter. 

Finally, we repeated the previous comparisons between 
the 106 systems inside the void catalog volume with those 
155 outside it having 0.01 < Zabs < 0.102. The KS test 
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Figure Al. H I column density versus H I Doppler parameter for 
systems in our sample. Different characterization methods (see 
i]A2l l are shown by different colors/symbols. Red circles corre- 
spond to systems measured by Voigt profile fits. Blue squares 
correspond to systems measured by the AOD method. Green tri- 
angles correspond to systems measured by a COG solution. His- 
tograms (arbitrary binning) are also shown around the main panel 
using the previous color definition. 



gives this time a probability of ~ 23% and « 81% of both 
A^Hi and feni inside and outside the void catalog volume are 
drawn from the same parent distribution, respectively. No 
significant (> 2a) differences are found for these samples. 
Thus, we conclude that the properties of the systems in the 
sightlines used in this work are not statistically different 
from the properties of the systems in other sightlines when 
we restrict ourselves to the same redshift range. 

We note that for our comparisons we use systems with 
A^'hi below the completeness of the column density distri- 
bution itself (A^'hi < 10^^ '* cm~^). The classification of ab- 
sorber environment does not depend on column density but 
rather corresponds to a geometrical association of absorbers 
with known galaxy voids. Therefore, this should not affect 
any of our results. We also checked that none of the AGN 
used here were observed in particular for having sightlines 
intersecting a known void region. 



without distinction between these different methods we will 
explore any possible bias that this could produce. 

Figure lXTI shows H I column densities versus H I Doppler 
parameters for the 106 systems in our sample, where the 
different characterization methods used by DS08 are shown 
in different colors/symbols. It is possible to observe a clear 
separation between column densities derived from Ly-a-only 
methods and COG solutions. This difference is a direct con- 
sequence of the fact that COG solutions can only be ob- 
tained for systems showing a high order Lyman series line, 
whose equivalent widths are smaller than Ly-a for the same 
A^'hi and bm values (because they have smaller wavelengths 
and smaller oscillator strengths) . This results in a shift of the 
completeness level to higher column densities for the COG 
solutions, as can be seen in Figure ET] On the other hand, 
no significant difference seems to be present for bm values 
between different characterization methods, which has been 
confirmed by a KS test. 

Our definitions of the different LSS samples do not de- 
pend on either A^hi nor bui but it rather correspond to a 
geometrical selection of how far away a system lies from the 
center of a void. Thus, the aforementioned difference ob- 
served in Nui should not affect the conclusions drawn from 
this study as long as our different LSS samples have simi- 
lar contributions of systems measured with one method or 
the other. Assuming a Poisson distribution for the number 
of systems in each sample, there is a relative contribution 
of 82% ± 18% (82% ± 18%), 73% ± 17% (74% ± 17%) and 
82% ± 30% (79% ± 27%) for systems measured by either 
Voigt fit or AOD methods in the void, void-edge and un- 
classified samples respectively. These numbers are all con- 
sistent with each other within la. Therefore we conclude 
that different characterization methods do not introduce an 
important bias in our analysis. 

There is also a reported systematic effect seen when us- 
ing Ly-a only systems by which A^hi estimates are typically 
(with large scatter) low by a factor of ~ 3 while bm are 
high by a factor of ~ 2 (jShuU et all |2000| : (DanfortheFai] 
l2006h . This systematic effect is important for higher col- 
umn densities. Our sample is dominated by systems with 
Ahi< lO^'' cm~^, for which the effect is s maller than the 
quote d numbers (see figures 2 and 3 from iDanforth et al.l 
I2OO6I ). So, despite the fact that our sample is dominated 
by Ly-a-only measurements, we do not consider this effect 
to be important. Finally, as previously argued, even if any 
of these effects are present they will affect each of our LSS 
samples in roughly the same proportion. 



A2 Systematics in the DS08 characterization 
method 

A particular source of concern is the fact that DS08 used 
different methods for obtaining the H I column densities 
and Doppler parameters in the catalog. They used curve- 
of-growth (COG) solutions when other Lyman series lines 
were available other than Ly-a. For the rest of the systems 
they used either a single Voigt profile fit (preferentially for 
strong or blende d lines) and/or the appar ent optical depth 
method (AODM. ISavaee fc Sembachlll99ll . preferentially for 
weak, asymmetric or noisy lines). We will refer to these as 
Ly-a-only methods. Given that we have been using systems 



A3 Observables as a function of redshift 

We also checked that no bias is present in our samples as a 
function of redshift. Figure IX2I shows both Ahi and 6hi val- 
ues as a function of redshift (middle and bottom panels). 
Systems belonging to different LSS samples are shown by 
different color /symbols. Both distributions look very uni- 
form across the full redshift range. The top panel shows the 
distribution of void, void-edge and unclassified absorbers as 
a function of redshift. The KS test shows no significant dif- 
ference between these LSS samples. We conclude that there 
is no evident systematic effect as a function of redshift. For 
simplicity we have only used LSS definitions based on X 
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Figure A2. Distribution of observables as a function of redshift. 
Middle and bottom panels show the distribution of A'^hi ^^<^ 
bill respectively. Different LSS samples are shown by different 
color /symbols: void (black circles), void-edge (red squares) and 
unclassified (blue triangles). Top panel shows the distribution of 
different LSS as a function of redshift: void (black-solid line), 
void-edge (red-dashed line) and unclassified (blue-dotted line). 
For simplicity we only show LSS definitions based on X. 



in Figure IA21 but the previous results also hold using D 
instead. 

The catalog of P12 used a nearly complete, magnitude 
limited sample of galaxies to define the voids. Despite this, 
we performed an independent check by looking at the mean 
radius (i?void) as a function of redshift. If the catalog is well 
defined, we should expect to have this radius constant across 
redshift range (assuming no measurable evolution). We con- 
firmed that this is actually the case by dividing the sample in 
6 redshift bins and measuring the mean value. We found that 
the mean radius is constant with (iivoid) « 13±3 /i~^Mpc in 
each bin. 

This paper has been typeset from a T^j]X/ ffl^jX file prepared 
by the author. 



